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DETAILED ACTION 



Claim Rejections - 35 USC § 102 



1 . The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that 

form the basis for the rejections under this section made in this Office action: 
A person shall be entitled to a patent unless - 

(e) the invention was described in (1) an application for patent, published under section 122(b), by 
another filed in the United States before the invention by the applicant for patent or (2) a patent 
granted on an application for patent by another filed in the United States before the invention by the 
applicant for patent, except that an international application filed under the treaty defined in section 
351(a) shall have the effects for purposes of this subsection of an application filed in the United States 
only if the international application designated the United States and was published under Article 21(2) 
of such treaty in the English language. 

Claims 1-3, 7-9, 13-15, 19-21, 25-27, and 29 are rejected under 35 U.S.C. 102(e) 
as being anticipated by Ahmad et al. (US006263507B1). 

Regarding claim 1, Ahmad et al. (Ahmad) discloses a control device and system 
controller or "multimedia summary generator" for use in electronic devices such as 
televisions, a computer display monitor, or "video display system" (See column 1 lines 
10-20 and column 2 lines 60-67). The control unit and system controller is able to 
summarize multiple television news programs or "video programs" within a graphical 
user interface (GUI) or "multimedia summary" (See Fig. 2A and 2B; column 4 lines 40- 
60). The control device and system controller extracts video images or "audio-video 
segments" (See column 16 line 55 - column 17 line 10) and text from a transcript (See 
column 33 line 60 - column 34 line 10) and combines the extracted pieces to provide a 
summary of the television news broadcast within the GUI or "multimedia summary of 
said video program (See Fig. 2A and 2B). 
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Regarding claims 2 and 3, the control device and system controller extracts video 
images or also known as "audio-visual template" that relate to the news stories or 
"topics" of the news program (See column 16 line 55 - column 17 line 10). The news 
programs have different stories, which are identified within the transcript by markers or 
"topic cue", and each video images represents a news story within the GUI (See Fig. 2A 
and 2B; column 23 lines 30-50). Furthermore, the control device and system controller 
executes this process, where inherently it is capable of executing computer instructions 
within a memory coupled thereto. 

Claim 7 contains the limitations of claim 1 (wherein the control device and system 
controller are embodied along with a television, a computer display monitor, or "video 
display system") and is analyzed as previously discussed with respect to that claim. 

Claims 8 and 9 contains the limitations of claims 2, 3, and 7 and is analyzed as 
previously discussed with respect to those claims. 

Claim 13 contains the limitations of claim 1 and is analyzed as previously 
discussed with respect to that claim. 

Claims 14 and 15 contains the limitations of claims 2, 3, and 13 and is analyzed 
as previously discussed with respect to those claims. 

Claim 19 contains the limitations of claim 1 and is analyzed as previously 
discussed with respect to that claim. Furthermore, Ahmad discloses that the process 
can be executed via instructions within a computer readable medium encoded with one 
or more computer programs (See column 5 lines 5-10). 
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Claims 20 and 21 contains the limitations of claims 2, 3, and 19 and is analyzed 
as previously discussed with respect to those claims. 

Regarding claim 25, the summary within the GUI has at least one video images 
of the news program (See Fig. 2A and 2B). 

Regarding claim 26, the summary within the GUI has at least one text portion of 
transcript (See Fig. 2Aand 2B; column 16 line 55 -column 17 line 10; column 33 line 
60 -column 34 line 10). 

Claim 27 contains the limitations of claims 2, 3, and 25 (wherein the GUI displays 
the summary information, including the video images representing each news story) and 
is analyzed as previously discussed with respect to those claims. 

Regarding claim 29, the GUI displays text summaries of the news program or 
"text from said video program", pictorial representations of the news program or "single 
frame", and video images or "video segment" (See Fig. 2A and 2B; column 16 line 55 - 
column 17 line 10). Furthermore, Ahmad discloses that the system is also capable of 
presenting information using audiovisual display or "audio-visual segment" (See column 
2 line 60 - column 3 line 15). 



Claim Rejections - 35 USC § 103 

2. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 

obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 
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Claims 30, 31 , and 34 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Ahmad et al. (US006263507B1). 

Claim 30 contains the limitations of claims 2, 3, and 27 and is analyzed as 
previously discussed with respect to those claims. Furthermore, Official Notice is taken 
that it is well known to provide more than one video images or "plurality of audio-visual 
segments" for each news story. Therefore, it would have been obvious to modify the 
control device and system controller disclosed by Ahmad to provide more than one 
video images in order to provide a more detailed visual summary of the story. 

Regarding claim 31, Ahmad discloses an information map region where the user 
can easily move freely among different news stories of the news program or "topic entry 
point". Each row allows the user to jump from one news story to another and would 
display the associated video images and text related to that story (See Fig. 2A and 2B; 
column 16 lines 3-55). 

Claim 34 contains the limitations of claims 30 and 31 and is analyzed as 
previously discussed with respect to those claims. 

Claims 4-6, 10-12, 16-18, 22-24, 28, 32, 33, and 35 are rejected under 35 U.S.C. 
103(a) as being unpatentable over Ahmad et al. (US006263507B1) in view of Liou et al. 
(US006580437B1). 

Regarding claim 4, Ahmad et al. (Ahmad) discloses a control device and system 
controller or "multimedia summary generator" for use in electronic devices such as 
televisions, a computer display monitor, or "video display system" (See column 1 lines 
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1 0-20 and column 2 lines 60-67). The control unit and system controller is able to 
summarize multiple television news programs or "video programs" within a graphical 
user interface (GUI) or "multimedia summary" (See Fig. 2A and 2B; column 4 lines 40- 
60). The control device and system controller extracts video images or "audio-video 
segments" (See column 16 line 55 - column 17 line 10) and text from a transcript (See 
column 33 line 60 - column 34 line 10) and combines the extracted pieces to provide a 
summary of the television news broadcast within the GUI or "multimedia summary of 
said video program (See Fig. 2A and 2B). Furthermore, the control device and system 
controller extracts video images or also known as "audio-visual template" that relate to 
the news stories or "topics" of the news program (See column 16 line 55 - column 17 
line 10). The news programs have different stories, which are identified within the 
transcript by markers or "topic cue", and each video images represents a news story 
within the GUI (See Fig. 2A and 2B; column 23 lines 30-50). Lastly, the control device 
and system controller executes this process, where inherently it is capable of executing 
computer instructions within a memory coupled thereto. However, Ahmad lacks a 
feature where the system can identify a subtopic cue and select a video image related 
to the subtopic to add to the GUI summary. 

Liou et al. (Liou) discloses a system for organizing videos based on closed- 
caption information. The system constructs a video table of contents. The system builds 
a tree defining the different stories or "topic cues" and the different speakers or 
"subtopic cues" associated with the stories. The system breaks the video up according 
to the closed-caption data and the tree is made up of video shots and text that are 
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associated with each story and speaker (See Fig. 5, 6, and 9; column 1 lines 20-30, 
column 2 lines 40-65, column 6 lines 35-55, column 7 lines 30-60). Therefore, it would 
have been obvious to one with ordinary skill in the art at the time the invention was 
made to modify the control device and system controller disclosed by Ahmad to identify 
speakers or "subtopic cues" within a news story and select a video image or shot 
related to the speaker to add to the GUI summary, as taught by Liou, in order to provide 
a more informative and detailed summary of the news program for the user. 

Regarding claim 5, the control device and system controller performs all the 
functions of the "domain identification application", "topic cue identification application", 
"subtopic cue identification application", and "audio-visual template identification 
application" as discussed in claims 1-4. 

Regarding claim 6, Ahmad discloses an information map region where the user 
can easily move freely among different news stories of the news program or "entry point 
for each topic". Each row allows the user to jump from one news story to another and 
would display the associated video images and text related to that story (See Ahmad 
Fig. 2A and 2B; column 16 lines 3-55). Furthermore, Liou discloses that the organized 
tree of the video table of contents allows users to jump directly to different speakers or 
"entry point for each subtopic" (See Liou Fig. 8 and 9). 

Claim 1 0 contains the limitations of claims 4 and 9 and is analyzed as previously 
discussed with respect to those claims. 

Claim 1 1 contains the limitations of claims 5 and 9 and is analyzed as previously 
discussed with respect to those claims. 
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Claim 12 contains the limitations of claims 6 and 10 and is analyzed as 
previously discussed with respect to those claims. 

Claim 16 contains the limitations of claims 4 and 15 and is analyzed as 
previously discussed with respect to those claims. 

Claim 17 contains the limitations of claims 5 and 15 and is analyzed as 
previously discussed with respect to those claims. 

Claim 18 contains the limitations of claims 6 and 16 and is analyzed as 
previously discussed with respect to those claims. 

Claim 22 contains the limitations of claims 4 and 21 and is analyzed as 
previously discussed with respect to those claims. 

Claim 23 contains the limitations of claims 5 and 21 and is analyzed as 
previously discussed with respect to those claims. 

Claim 24 contains the limitations of claims 6 and 22 and is analyzed as 
previously discussed with respect to those claims. 

Claim 28 contains the limitations of claims 4 and 27 (wherein each speaker has 
one video shot related to the speaker (See Liou Fig. 8 and 9)) and is analyzed as 
previously discussed with respect to those claims. 

Claim 32 contains the limitations of claims 4 and 30 and is analyzed as 
previously discussed with respect to those claims. Furthermore, Official Notice is taken 
that it is well known to provide more than one video images or "plurality of audio-visual 
segments" for each speaker of a the news story. Therefore, it would have been obvious 
to modify the control device and system controller disclosed by Ahmad in view of Liou to 
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provide more than one video images in order to provide a more detailed visual summary 
of where the speaker is in association with the story. 

Claim 33 contains the limitations of claims 6, 31 , and 32 and is analyzed as 
previously discussed with respect to those claims. 

Claim 35 contains the limitations of claims 32 and 33 and is analyzed as 
previously discussed with respect to those claims. 

Claims 36-38 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Ahmad et al. (US006263507B1) in view of Name-It Feature Article and Hoarty et al. 
(US005093718A). 

Regarding claim 36, Ahmad et al. (Ahmad) discloses a control device and system 
controller or "multimedia summary generator" for use in electronic devices such as 
televisions, a computer display monitor, or "video display system" (See column 1 lines 
10-20 and column 2 lines 60-67). The control unit and system controller is able to 
summarize multiple television news programs or "video programs" within a graphical 
user interface (GUI) or "multimedia summary" (See Fig. 2A and 2B; column 4 lines 40- 
60). The control device and system controller extracts video images or "audio-video 
segments" (See column 16 line 55 - column 17 line 10) and text from a transcript (See 
column 33 line 60 - column 34 line 10) and combines the extracted pieces to provide a 
summary of the television news broadcast within the GUI or "multimedia summary of 
said video program (See Fig. 2A and 2B). However, Ahmad lacks a feature where it 
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obtains an image of a face of a person, verifies the identity, and adds the image of the 
face to the GUI summary. 

The article Name-It discloses a method for matching faces in a news video with 
their name or "identity". The method extracts the face of a person from a video and the 
name of the person from the transcript of the video. Then it compares the collected 
information to the face similarity information to confirm that the name is matched to the 
correct face (See page 22 column 1 and 2, page 25 column 1 and 2). Therefore, it 
would have been obvious to one with ordinary skill in the art at the time the invention 
was made to modify the control device and system controller disclosed by Ahmad to 
extract an image of a face of a person and confirm the persons name, as taught by the 
article Name-It, in order to provide a means of correctly identifying the people on the 
video so that the user is aware of who is presenting the story. 

Hoarty et al. discloses an interactive home information system where users can 
obtain information about TV listing for the month. The system provides images of actors 
or scenes from movies and TV shows within the information or "summary" (See column 
7 lines 20-30). Therefore, it would have been obvious to one with ordinary skill in the art 
at the time the invention was made to modify the control device and system controller 
disclosed by Ahmad to add the image of the face to the GUI summary, as taught by 
Hoarty et al., in order to provide a more detailed summary of the story and who is 
presenting the story in one convenient screen. 
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Regarding claim 37, the system confirms the identity of the person by using the 
face similarity information of the person and the name from the transcript as discussed 
in claim 36. 

Claim 38 contains the limitations of claims 36 and 37 and is analyzed as 
previously discussed with respect to those claims. Furthermore, Official Notice is taken 
that it is well known to use voice recognition to identify a person. Therefore, it would 
have been obvious to one with ordinary skill in the art at the time the invention was 
made to modify the control device and system controller disclosed by Ahmad in view of 
Name-It to identify a person using voice recognition in order to provide a more accurate 
means of correctly identifying the person. 

Conclusion 

3. The prior art made of record and not relied upon is considered pertinent to 
applicant's disclosure. Please take note of Lawler et al. (US005907323A) for his similar 
use of a summary panel. 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Joseph Ustaris whose telephone number is (703) 305- 
0377. The examiner can normally be reached on Monday-Friday with alternate Fridays 
off from 7:30 A.M. to 5:00 P.M. 
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If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Andrew Faile, can be reached on (703) 305-4380. The fax phone number 
for this Group is (703) 872-9306. 

Any inquiry of general nature or relating to the status of this application or 
proceeding should be directed to the Group Receptionist whose telephone number is 
(703) 305-4700. 





JGU 

March 31,2004 



VIVEK SRIVASTAVA 
PRIMARY EXAMINER 



